Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model
نویسندگان
چکیده
---------------------------------------------------------------------***--------------------------------------------------------------------Abstract In this work a multilingual speaker identification system is proposed. The feature extraction techniques employed in system extract Mel frequency cepstral coefficient (MFCC), delta mel frequency cepstral coefficient (DMFCC) and format frequency. The feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA). We have used Learning Vector Quantization (LVQ) artificial Neural Network classifiers. The speech database consists of 40 speakers (20 males+ 20 females) speech utterance. The speech utterance is recorded for a specific sentence in three different languages viz. “Now this time you go” (in English), “Adhuna Asmin Twam Gachh “(in Sanskrit), “Ab Iss Baar Tum Jao” (in Hindi). Total word for this purpose is 14 including 4 for Sanskrit and 5 Hindi and English. The average identification rate 79.99% is achieved when the Network is trained by LVQ and it shows 80.52% when LVQ is trained using hybrid PSO-GA model.
منابع مشابه
Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model
In this work a multilingual speaker identification system is proposed. The feature extraction techniques employed in the system extract Mel frequency cepstral coefficient (MFCC), delta mel frequency cepstral coefficient (DMFCC) and format frequency. The feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA). We have used Back Propagation (BP...
متن کاملSpeaker Recognition Using Gaussian Mixtures Models
Speech signal contains several levels of information. At first it contains information about the spoken message. At second level speech signal also gives information about the speaker identity, his emotional state and so on. The task of speaker recognition can be divided into two parts: speaker identification and speaker verification. Speaker identification is answering the question which one o...
متن کاملSpeaker Identification From Youtube Obtained Data
An efficient, and intuitive algorithm is presented for the identification of speakers from a long dataset (like YouTube long discussion, Cocktail party recorded audio or video).The goal of automatic speaker identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech s...
متن کاملModified Technique for Speaker Recognition using ANN
Speaker recognition consists of three phases: pre-processing, feature extraction and classification. During the first phase, the computer records the voice pattern of the speaker and analyse it. By the end of the second phase, the main features of the voice pattern are extracted. In the third phase, many classification techniques are exist such as artificial neural network (ANN) , hidden Markov...
متن کاملSpeaker identification using discriminative features selection
A new method of text-dependent speaker identification using discriminative feature selection is proposed in this paper. The characteristics of the proposed method are as follows: feature parameters extraction, vector quantization with the growing neural gas (GNG) algorithm, model building using gaussian distributions and discriminative feature selection (DFS) according to the uniqueness of pers...
متن کامل